An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
نویسندگان
چکیده
1 School of Computer and Information Technology, Liaoning Normal University, Dalian 116029, China 2 State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China 3Department of Engineering, Faculty of Engineering and Science, University of Agder, 4898 Grimstad, Norway 4College of Engineering and Science, Victoria University, Melbourne, VIC 8001, Australia 5 School of Electrical and Electronic Engineering, The University of Adelaide, Adelaide, SA 5005, Australia
منابع مشابه
Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning
Since most real-world applications of classification learning involve continuous-valued attributes, properly addressing the discretization process is an important problem. This paper addresses the use of the entropy minimization heuristic for discretizing the range of a continuous-valued attribute into multiple intervals. We briefly present theoretical evidence for the appropriateness of this h...
متن کاملInterval Similarity-Based Quantization Method for Continuous Data
Data quantization methods for continuous attributes play an extremely important role in artificial intelligence, data mining and machine learning because discrete values of attributes are required in most classification methods. In this paper, we present an interval similarity-based quantization method for continuous data. It defines an interval similarity criterion which is regarded as a new m...
متن کاملA Framework for Optimal Attribute Evaluation and Selection in Hesitant Fuzzy Environment Based on Enhanced Ordered Weighted Entropy Approach for Medical Dataset
Background: In this paper, a generic hesitant fuzzy set (HFS) model for clustering various ECG beats according to weights of attributes is proposed. A comprehensive review of the electrocardiogram signal classification and segmentation methodologies indicates that algorithms which are able to effectively handle the nonstationary and uncertainty of the signals should be used for ECG analysis. Ex...
متن کاملGenetic Fuzzy Discretization for Classification Problems
Many real-world classification algorithms can not be applied unless the continuous attributes are discretized and the interval discretization methods are used in many machine learning techniques. It is hard to determine the intervals for the discretization of numerical attributes that has an infinite number of candidates. And interval discretization methods are based on a crisp set, a value in ...
متن کاملThe Development of the Generalization Algorithm Based on the Rough Set Theory
This paper considers the problem of concept generalization in decision-making systems where such features of real-world databases as large size, incompleteness and inconsistence of the stored information are taken into account. The methods of the rough set theory (like lower and upper approximations, positive regions and reducts) are used for the solving of this problem. The new discretization ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Applied Mathematics
دوره 2013 شماره
صفحات -
تاریخ انتشار 2013